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(57) A process for selecting, in a content delivery 
network having at least two surrogate servers delivering 
content over a network, a surrogate server adapted to 
deliver content requested by a user. First, the network 
is divided into subnetworks; upon request of the user, 
the process finds (4) a subnetwork of the user. Based 
on this subnetwork, the process then determines (8) the 
surrogate server closest to the subnetwork of the user. 
The process then checks (1 0) whether the closest sur- 
rogate server has the content requested by the user. If 
this is the case, the closest surrogate server is selected. 
Else, the process loops by passing to the next closest 
surrogate server, and checking whether it has the con- 
tent requested by the user. 
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Description 

[0001] The invention relates to the field of networks, 
and more specifically to content delivery networks. 
[0002] The World Wide Web (Web) uses the stateless 
hypertext transfer protocol (HTTP) to allow a client to 
request resources from a server over a network. The 
HTTP protocol enables the transparent navigation from 
Web servers to Web servers through hypertext links. A 
hypertext link makes an association between a part of 
text and a URL (Uniform Resource Locator). A resource 
itself can contain multiple resource locators. A Web 
browser that runs on the machine of the client sends a 
HTTP GET message to a Web server running on the 
machine of the server. The address of the resource on 
the server is determined by a Uniform Resource Locator 
(URL) from the form "<protocol>://<server-address>/ 
<path>/<content>". URLs with other protocols than HT- 
TP are typically served by specialized servers (such as 
FTP servers, streaming media servers, etc.). The pro- 
tocol HTTP is a client-server protocol, which uses a TCP 
network connection. How the data is interpreted by the 
browser depends on the Ml ME type of the returned data; 
other protocols than HTTP are usually handled by plug- 
ins that are called by the browser. 
[0003] Resources that are heavyweight in relation to 
text based resources, such as images, audio, video, and 
large files, can place a burden for the server, especially 
if they are often requested. Content Delivery Networks 
(CDN) replicate resources to distribute the load to mul- 
tiple surrogate servers; a CDN thus comprises a number 
of surrogate servers, in which resources are replicated, 
and a redirection server for managing, upon request of 
a user, which of the surrogate servers should deliver the 
resources. 

[0004] The main difference between known CDN 
techniques is their mechanism to choose the surrogate 
server that serves the resource. US-B-6 185 598 sug- 
gests providing reflector mechanisms for intercepting 
resource requests made by clients and selectively re- 
flecting the requests to repeaters. The reflectors select 
a best repeater from a set of possible repeaters and re- 
direct the client to the selected best repeater. The client 
then makes the request to the selected best repeater. 
For selecting the best repeater, the network is parti- 
tioned into cost groups; according to the user, a set of 
"low cost" repeaters is selected, and one repeater is se- 
lected at random among the set. If the requested re- 
source is not in the selected repeater, the repeater cop- 
ies it from the origin server, and saves a local copy of 
the resource in order to serve subsequent requests. 
[0005] US-A-6 1 08 703 discusses a network architec- 
ture for a content provider, having a content provider site 
and a number of ghost servers. It suggests serving a 
base HTML portion of a web page from the content pro- 
vider site, and serving the embedded objects for the 
page from the ghost servers located near to the client 
machine. This document suggests balancing load 



among ghost servers. Replication of objects from the 
content provider site to the ghost servers is carried out 
in a demand-based pull fashion : if a copy of a requested 
object is not present on the ghost server located near to 

5 the user, the ghost server retrieves a copy of the object 
from the content provider site or from another ghost 
server. Apart from this process for replicating resources, 
the described method uses dynamic DNS redirection to 
select a surrogate server close to the end-user. The 

10 main disadvantage of DNS redirection is that the result 
might not be accurate for the requester, since the DNS 
server used by the end-user's browser is not close to 
the end-user. Moreover, with dynamic DNS it is not pos- 
sible to get the context of the end-user. 

15 [0006] US-A-6 052 71 8 discusses replica routing, that 
is automatically redirecting client computers that re- 
quest a service to a server replica for this service. The 
server replica selected is the one expected to provide 
the best performance to the client, based upon the cli- 

20 ent's location in the internetwork topology and the esti- 
mated performance of the internetwork. The described 
method uses real-time measurements, when an end-us- 
er requests a resource to calculate the nearest replica 
server. The end-user's browser is redirected by a replica 

25 router to the replica server by receiving from the de- 
scribed method an HTTP REDIRECT answer. Thus the 
end-user's browser has to connect first to the to the rep- 
lica router and then a second time to the replica server. 
This may cause unnecessary latency. This document 

30 further suggests that server replicas contain the same 
basic information, but be specialized with local features, 
e.g. language features or buyer-location specific prices. 
In this case, the server replica is not selected according 
to the request of the client or according to the nature of 

35 the resource, but rather according to the geographical 
area, so that client routing is based on content special- 
ization, but not on proximity. 

[0007] US-A-6 003 030 suggests replicating informa- 
tion among "smart mirror" sites. Every user is assigned 

40 to a specific delivery site based on an analysis of net- 
work performance with respect to each of the available 
delivery sites. The described method relies on special 
software that is installed at the end-user terminal, to 
choose the best surrogate server. This document does 

45 not specifically discuss the content of the "smart mirror" 
sites, the implicit assumption being that all mirror sites 
of the same content provider contain the same informa- 
tion. 

[0008] US-A-5 870 546 discusses redirection meth- 
50 ods. These are used for tracking of a client reference to 
a resource locator. However, this document does not 
discuss how redirection methods could be applied to 
content delivery networks, or to the selection of a surro- 
gate server in such a CDN. 
55 [0009] M. A. Ruiz-Sanchez, Ernst W. Biersack, and 
Walid Dabbous, Survey and Taxonomy of IP Address 
Lookup Algorithms, IEEE Network Magazine, 15(2):8~ 
23, March/April 2001 discusses the problem of IP ad- 



2 



3 



EP 1 322 094 A1 



4 



dresses lookup. The algorithms disclosed in this docu- 
ment are applied to the routing of IP packets in a router. 
[0010] In view of this prior art, there is a need for a 
method for selecting in a content delivery network, the 
"best" surrogate server for serving requests to users. 
Such a method should be fast, easy to implement; pref- 
erably, it should not necessitate additional software or 
hardware at the client's side or additional hardware with- 
in the network. The method should serve the users as 
rapidly as possible, and should not cause unnecessary 
latency. 

[0011] According to the invention, there is provided a 
process for selecting, in a content delivery network hav- 
ing at least two surrogate servers delivering content 
over a network, a surrogate server adapted to deliver 
content requested by a user, the process comprising the 
steps of: 

defining at least two subnetworks in the network, 
and upon request of the user, finding a subnetwork 
of the user; 

determining the surrogate server closest to the sub- 
network of the user; 

checking whether the closest surrogate server has 
the ability to serve the content requested by the us- 
er, and 

if the closest surrogate server has the ability to 
serve the content requested by the user, select- 
ing the closest surrogate server; 
else, if the closest surrogate server does not 
have the ability to serve the content requested 
by the user, disregarding the closest possible 
surrogate server and repeating the step of de- 
termining and checking. 

[0012] In a preferred embodiment, the step of finding 
a subnetwork is carried out using an algorithm with a 
time complexity equal to or less than O(W), with W the 
length of an address in the network. 
[0013] The step of determining may comprise 

measuring a distance between each surrogate 
server and each subnetwork; 
according to the subnetwork of the user, selecting 
a surrogate server with the smallest distance. 

[0014] In this case, a distance between a surrogate 
server and a subnetwork may be selected from: 

a number of hops within the network from the sur- 
rogate server and a representative of the subnet- 
work; 

a round trip time within the network from the surro- 
gate server to a representative of the subnetwork; 
a parameter entered manually or read from a con- 
nection table; 

available or bottleneck bandwidth within the net- 



work from the surrogate server and a representative 
of the subnetwork; 

costs to reach a representative of the subnetwork 
from the surrogate server. 

5 

[0015] Distances may be stored in a table with a sub- 
network identifier as key. In this case, an entry in the 
table is preferably a list of surrogate servers identifiers, 
ordered by distance to the subnetwork identified by the 
10 key to the entry. 

[0016] In another embodiment, the step of determin- 
ing is carried out using an algorithm with a space com- 
plexity equal or less than 0(N*(S+1 )), with N the number 
of subnetworks and S the number of surrogate servers. 
15 [0017] The step of checking may comprise 

storing for each surrogate server identifiers of the 
content contained in the surrogate server; 
mapping an identifier of the content requested by 
20 the user; and 

comparing the identifier of the content requested by 
the user and the identifiers of the content contained 
in the closest surrogate server. 

25 [0018] In this case, identifiers may be stored in a table 
with the content identifier as key. Each entry can be a 
bit array, one bit in the array being representative of the 
presence in a surrogate server of the content corre- 
sponding to the key to the entry. 
30 [0019] According to another feature of the invention, 
the step of checking is carried out using an algorithm 
with a space complexity equal or less than 0(C*S), with 
C the number of content and S the number of surrogate 
servers. 

35 [0020] Preferably, a request from a user contains an 
identifier of a set of real contents, each of said real con- 
tent being contained in the content delivery network; in 
this case, the process further comprises, before the step 
of checking, a step of selecting a real content of the set 
40 according to the request of the user, and the step of 
checking comprises checking whether the closest sur- 
rogate server has the selected real content. In this case, 
the step of selecting a real content is preferably carried 
out according to information related to the user. This us- 
45 er-related information related to the user may be select- 
ed among: 

technical characteristics of a device of the user; 
a location of the user; 
50 - past behaviour of the user; 
a subnetwork of the user; 
identity of the user. 

[0021] Real contents within a set may represent: 

55 

the same content with a different video quality or a 

different audio quality; 

the same content in different languages; 
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advertisements; 
regionalized advertisements, 
personalized advertisements 
independent advertisements. 

5 

[0022] In another embodiment of the invention, the 
network comprises a private network, addresses of 
which are hidden from the rest of the network. The pri- 
vate network contains at least one surrogate server, and 
the process comprises 10 

providing a local redirection server within the private 
network; 

carrying the steps of defining, determining and 
checking within the local redirection server, and 15 
if no surrogate server is selected within the private 
network, repeating the steps of defining, determin- 
ing and checking within a redirection server of the 
content delivery network outside of the private net- 
work. 20 

[0023] In yet another embodiment of the invention, the 
network comprises a private network, addresses of 
which are hidden from the rest of the network. The pri- 
vate network contains at least one surrogate server, and 25 
the process comprises 

providing a local redirection server within the private 
network; 

upon request of the user, having the local redirec- 30 
tion server forward an address of the user in the pri- 
vate network to a redirection server of the content 
delivery network outside of the private network. 

[0024] The steps of defining, determining and check- 35 
ing may then be carried out in the redirection server out- 
side of the private network. 

[0025] The invention also proposed a process for se- 
lecting, in a content delivery network having at least two 
surrogate servers delivering content over a network, a 40 
surrogate server adapted to deliver content requested 
by a user, the process comprising the steps of: 

defining at least two subnetworks in the network, 
and upon request of the user, finding a subnetwork 45 
of the user; 

determining the surrogate servers having the ability 
to serve the content requested by the user; 
selecting among the surrogate servers thus deter- 
mined the surrogate server closest to the subnet- 50 
work of the user. 

[0026] The invention also propose a computer pro- 
gram product adapted to carry out this process, as well 
as a content delivery network with a redirection server 55 
and at least one surrogate server, the redirection server 
comprising computer programs means for carrying out 
the process. 



[0027] A network embodying the invention will now be 
described, by way of nonlimiting example, and in refer- 
ence to the accompanying drawings, where : 

figure 1 is a flowchart of the steps of a process ac- 
cording to an embodiment of the invention; 
figure 2 is a description of the invention in terms of 
input and output of information; 
figure 3 is a flowchart of the meta-content lookup 
process in one embodiment of the invention; 
figure 4 is an example of a table used for selecting 
a surrogate server closest to a subnetwork, in one 
embodiment of the invention; 
figure 5 is an example of a table used for checking 
whether a surrogate server has a selected content, 
in one embodiment of the invention; 
figure 6 is a schematic view of a network architec- 
ture where the invention may be carried out; 
figure 7 shows the structure of an object in an em- 
bodiment of the invention; 

figure 8 is an example of meta-content identification 
in an embodiment of the invention; 
figures 9 to 11 show objects in one embodiment of 
the invention. 

[0028] The invention proposes a method for selecting 
a surrogate server, in a content delivery network. It is 
intended to select the "optimal" surrogate server for the 
requested resource, for a request from a given user (or 
client). The word "optimal" reflects the capacity of the 
CDN to provide the requested resource to the user, as 
fast and as conveniently as possible for the user and, in 
the case of streams, as uninterrupted as possible. 
[0029] In the rest of the description, the invention is 
discussed in reference to an embodiment of the inven- 
tion on the Internet, which also incorporates means to 
adapt the content for each viewer following given rules. 
This makes it possible to deliver different contents to dif- 
ferent users although they may have issued a request 
for the same resource. As shown in the examples below, 
the contents may differ according to various criteria, 
such as location of the user in the network, past behav- 
iour of the user, subnetwork of the user, identity of the 
user or the technical characteristics of the device of the 
user, including speed of the device used by the user for 
connecting to the network or display device capabilities 
of the end user. This feature of the invention is however 
not compulsory, and selection of a surrogate server may 
be carried out without embodying this feature. For the 
sake of brevity, this feature is included in the description 
of the embodiments. 

[0030] For a full understanding of this feature of the 
invention, use is made of the word "meta-content". A 
meta-content is an identifier of a set of "real" contents, 
that is of contents actually provided by the CDN. The 
process does not enforce any constraint on how to 
group the real contents to obtain a meta-content. The 
way to group contents depends on the desired function- 
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ality. For instance: 

in case of economic broadband, a meta-content 
could represent a set of content with the same in- 
formation but with different quality - such as a movie 
at low, medium, and high quality; 
in case of an enterprise, a meta-content could rep- 
resent different news of the day depending of the 
department the user is affiliated with; 
in case of advertisement insertion, the meta-con- 
tent represents a set of advertisements. 

[0031] The idea behind the notion of meta-content is 
that whereas only one URL needs to be maintained (one 
URL per meta-content), an end user can receive differ- 
ent content through this URL depending on a policy de- 
fined by the media manager. This policy depends on in- 
formation regarding the user. For instance, in the case 
of economic broadband, the real content delivered to the 
end user may depend on the speed of the device 
through which the user connects to the network. In the 
case of advertisement insertion, the advertisement 
served on the user would depend on the past behaviour 
of the user, as discussed below in the example of a car- 
rousel of advertisements. 

[0032] As explained below, due to its highly modular 
structure, the invention allows for customization; thus 
the invention allows the support of various business 
models. The invention can be used to redirect to any 
resources a CDN might support, but is very well suited 
to support CDNs for streaming media. Streaming media 
are resources that are not downloaded to the end-user, 
but that are transmitted continuously from the server to 
the client until the resource was fully streamed or the 
viewer stops the receiving player. Streaming media is 
especially sensitive to interruptions through network 
congestion and packet losses. Streaming the media 
from the closest surrogate server minimizes the risks of 
loosing packets and of network congestion on the path 
of the streaming media packets. 
[0033] Figure 1 is a flowchart of the steps of a process 
according to an embodiment of the invention. Figure 2 
is a description of the invention in terms of input and 
output of information. The method is carried out upon 
receiving a request from a user. As discussed, it is as- 
sumed that this request is issued for a meta-content, 
which identifies a set of real content. 
[0034] The request is issued by the user simply by se- 
lecting one URL for a given meta-content. When an end 
user click on this URL, the redirection server of the CDN 
is provided with the information displayed in box 20 of 
figure 2, that is 

IP A , IP address of the end user A; 
MC A , meta-content requested by user A; 
l A , information on the user A. 

[0035] This is shown in step 2 of figure 2, where the 



redirection server receives the information from the us- 
er. It is also exemplified in figure 2 by the various arrows 
going from box 20 to box 22, which symbolizes the re- 
direction server. 
5 [0036] Based on the information thus received, in a 
first step of the process, represented at 4 in figure 1 or 
at 24 in figure 2, is the finding of a subnetwork of the 
user. A subnetwork is defined in the present description 
as one element of a partition of the network; one may 
10 simply define a subnetwork as a set of IP addresses. 
The set of IP addresses can be aggregated into intervals 
if the subnetwork owns all IP addresses within this in- 
terval. This aggregation is important within the Internet 
to keep the routing tables small. The most often used 
15 aggregation is the CIDR format, see M. A. Ruiz- 
Sanchez et al. for details. The algorithm for finding a 
subnetwork in this step is discussed at length below. Af- 
ter this step, there is provided a subnetwork identifier, 
representative of the subnetwork to which the user be- 
20 longs. In the example, the subnetwork is referenced 
Subnetwork A in figure 2 in box 26. The arrow in figure 2 
from box 24 to box 26 simply indicates that the subnet- 
work is identified in the first step. 
[0037] Once the subnetwork is identified, the process 
25 proceeds to step 6 of figure 2, where there is selected 
a content among the set of real content identified by me- 
ta content MC A . This step is represented in box 28 of 
figure 2. Once again, this step could be skipped, if the 
user issues a request to a given content. 
30 [0038] Selection of a content is discussed below in 
reference to the examples; it is sufficient here to clarify 
that the selection may depend on various information. 
As shown by the arrow joining MC A to box 28, selection 
of the content will first depend on the requested meta- 
35 content, which identifies the set of possible contents. 
Second, selection of the content may depend on infor- 
mation received from the end-user, as shown by the ar- 
row joining l A to box 28 in figure 2. Last, selection of the 
content may also depend on the location of the user in 
40 the network; one easy way to take this into account is 
to consider the subnetwork to which the user belongs. 
This is symbolized in figure 2 by the arrow joining boxes 
26 and 28. After this step, there is provided a specific 
content, which is actually stored in at least one surrogate 
45 server of the CDN. This specific content is the "best" 
adapted content for the user, in the sense of the rules 
defined by the media manager in the CDN. It is referred 
to in figure 2 as C A , and is represented in box 40, which 
contains the output from the redirection server. 
50 [0039] The process then passes to step 8 of figure 1 , 
box 30 of figure 2. In this step, the surrogate server clos- 
est to the subnetwork is determined. The "closest" serv- 
er is defined in terms of the network topology, and in the 
sense of the metric used to measure the distance be- 
55 tween surrogate servers and the subnetwork to which 
the end users belong. Examples of distances comprise 

a number of hops within the network from the sur- 
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rogate server and a representative of the subnet- 
work; 

a round trip time within the network from the surro- 
gate server to a representative of the subnetwork; 
a parameter entered manually or read from a con- 
nection table; 

network capacities within the network from the sur- 
rogate server and a representative of the subnet- 
work; 

costs to reach a representative of the subnetwork 
from the surrogate server. 

[0040] The representative of the subnetwork may 
simply be one node of the subnetwork. It is preferably 
selected so that the distance (as defined above) from 
the outside to the representative is more or less the 
mean distance from the outside to all members of the 
subnetwork. 

[0041] Algorithms for determining the closest surro- 
gate server are discussed below. As exemplified by the 
arrow joining box 26 and box 30 in figure 2, the input in 
this step is mainly the identifier of the subnetwork. After 
this step, there is provided a current closest surrogate 
server. 

[0042] In the next step 10, it is checked whether the 
current closest surrogate server has the ability to serve 
the content requested by the user. In case one does not 
use meta-content, this step amounts to checking wheth- 
er the current closest surrogate server has the content 
actually requested by the user. Otherwise, where one 
uses meta-content, this step amounts to checking 
whether the current closest surrogate server has the ac- 
tual content C A selected in step 6, and is available for 
serving this content. This is shown in figure 2 by the ar- 
row joining C A to box 30. Algorithms for carrying out this 
step are discussed below. 

[0043] If the answer in step 10 is "yes", then the cur- 
rent closest surrogate server has the relevant real con- 
tent C A and may accept the client request; the process 
is terminated (step 12). In figure 2, this is represented 
by the output of IP Ss , that is by the output of the IP ad- 
dress of the optimal surrogate server. One should note 
that the selected surrogate server is also the closest 
possible one, in the sense of the distance used in the 
process. 

[0044] Otherwise, if this current closest surrogate 
server cannot serve the actual content C A , the process 
passes to step 14. In step 14, the current closest surro- 
gate server is disregarded, and the process passes 
back to step 8. This means that step 8 will provide the 
next closest surrogate server - in terms of the distance 
used in the process. 

[0045] Thus, the method terminates when a surrogate 
server is found; although this is not represented in figure 
1 , the method would also terminate if no more surrogate 
servers exist within the CDN, or if there is no surrogate 
server that is able to serve the best adapted real content. 
[0046] In the description of step 10, it is checked 



whether the surrogate server has the content requested 
by the user, and whether it is available for serving this 
content; one may also carry out the checking of availa- 
bility of the server separately, or simply not check the 

5 availability of the server. The advantage in checking 
separately the availability of the surrogate server is that 
this availability may change often, while the content of 
the surrogate server is likely to be more stable. It is also 
possible that the server be available, but that restrictions 

10 exist regarding maximum bandwidth usage; this could 
also be construed as checking the "availability" of the 
surrogate server. 

[0047] Again, one should understand from the consid- 
eration of figures 1 and 2 that the proposed method 

15 would also operate even if the user did not issue a re- 
quest for a meta-content, but simply a request for a spe- 
cific "real" content actually contained in the CDN. In 
such a case, step 6 of figure 1 would simply be skipped. 
[0048] One may also depart from the order of step dis- 

20 closed in reference to figures 1 and 2. For instance, one 
may first scan surrogate server to identify those that 
have the requested content, and then select the closest 
surrogate server among this set. In figure 1, this 
amounts to carrying steps 4 and 8 immediately before 

25 step 12. 

[0049] The various possible algorithms usable in 
these steps are now discussed. Since the method is 
aimed to be executed for each request for a content that 
is managed by the CDN the calculation must be as fast 

30 as possible. The algorithms for each step may be opti- 
mized to run entirely in main memory of a standard PC. 
Moreover, the algorithms now discussed are fast and 
only require a few memory lookups. While the preferred 
embodiment runs on a single computer, the four steps 

35 can also be deployed on different computers connected 
over a network to handle more requests in parallel. 
[0050] The following four algorithms may be used to 
calculate the best adapted real content, and the address 
of the closest surrogate server to the requesting end us- 

40 er, which has this real content. The algorithms may work 
on data that is read in at initialization time. The data is 
stored by each algorithm in data structures that allow 
efficient lookups and efficient updates within the main 
memory, as discussed below. 

45 [0051] At initialization, the following data is provided : 

a list of subnetworks and a unique identifier for each 
subnetwork; as discussed above, subnetworks may 
be defined as IP address ranges. This list can be 
50 obtained e.g. from the routing tables of border In- 
ternet routers or route registries, or set by the net- 
work provider; 

a matrix that contains for each meta-content the list 
of the real contents associated to it, and the type of 
55 the object that will store the real contents and that 
will define the rules for the choice of the best adapt- 
ed real content; 

a matrix that assigns for each deployed surrogate 
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server a weight to reach the subnetwork. The 
weight can be e.g. the distance in hops from the sur- 
rogate server to a representing computer of the sub- 
network, or another of the distances discussed 
above; 

a matrix that contains for each surrogate server all 
content identifiers for the content that is accessible 
by the surrogate server; 

a table that stores the current state of each surro- 
gate server within the content delivery network; this 
table, when available, makes it possible to take into 
account the current status of the surrogate servers, 
and provides fail safe capabilities. 

[0052] The data can be stored in an external database 
that is read by the algorithms of the invented method at 
initializing time; it may be stored in different places if the 
four algorithms are carried out in different places. If the 
state in the content delivery system changes, the inter- 
nal data may be changed accordingly. As example, the 
following state changes, within the CDN, may result in 
a update of the data: insertion, changes to, deletion of 
new content; failing of a surrogate server, changes in 
the list of subnetworks, changes in the weights for a sur- 
rogate server to reach a subnetwork. 
[0053] The following data structures are created by 
the invented method, in its preferred embodiment, ac- 
cording to the initial data: 

a binary tree is used to store the subnetworks infor- 
mation in order to perform route lookups, or any oth- 
er data structure as proposed in M.A. Ruiz Sanchez 
etal.; 

a table, preferably a hash table, which keys are the 
meta-content identifiers and which values are the 
object corresponding to the meta-content. Each ob- 
ject contains a set of real contents and rules to 
chose which real content is best adapted for which 
end user; 

a table, preferably a hash table which keys are the 
subnetwork identifiers, and the values are ordered 
lists of all the surrogate servers, from the closest 
surrogate server for this subnetwork to the farthest 
surrogate server for this subnetwork; 
a table, preferably a hash table which keys are the 
real content id, and which values are the bit array. 
The n th bit set to 1 means that the surrogate server 
with id n is healthy and can serve the content. The 
n th bit set to 0 means that the surrogate server with 
id n cannot serve the content. 

[0054] Based on the data thus stored, the preferred 
embodiment of the four algorithms is described below. 
They make it possible to select the best-adapted real 
content and the optimal surrogate server, for the re- 
quester, which has this real content. 
[0055] As discussed above in step 4, the first algo- 
rithm researches the smallest subnetwork of IP ad- 



dresses from all subnets stored in a database. The prob- 
lem of finding the smallest subnetwork is identical to 
finding the best matching prefix of a subnet within IP 
address lookup algorithms used within IP routers. See 
5 M.A. Ruiz Sanchez et al. for a survey and taxonomy of 
known IP address lookup algorithms. The input of the 
first algorithm is the IP address of the end-user's com- 
puter (or its gateway, if the computer is within a private 
network, as in enterprises for example). In the preferred 
10 embodiment, the subnetworks are stored in the CIDR 
notation in a database and uniquely enumerated and 
loaded at initialization time into the data structure need- 
ed by the first algorithm. In the preferred embodiment, 
we choose the binary tree algorithm, which is the most 
15 basic algorithm. Faster algorithms like binary search on 
prefix length could be taken instead, see M.A. Ruiz 
Sanchez et al. for different algorithms. Typical routing 
databases (such as the route table of a backbone router) 
can be stored with the discussed algorithms in main 
20 memory; the size of a binary tree in memory is in O(NW) 
in the worst case, where N is the number of given sub- 
networks and W is the length of the longest prefix (32 in 
Ipv4). A typical number for N is about 100,000 entries. 
The exact size of the structure highly depends on the 
25 given subnetworks and their prefix length. In the worst 
case, finding a subnetwork within this structure has a 
time complexity in O(W). 

[0056] As discussed above, the output of the first al- 
gorithm is the number or identifier of the smallest sub- 
30 network in which the IP address of the end-user's com- 
puter resides. One may also use algorithms other than 
the one discussed above. 

[0057] As discussed above in step 6, a real content 
may be selected, in case the user's request only issues 
35 a meta-content - an identifier to a set of real contents. 
The algorithm carried out in this step returns the real 
content id according to the meta-content id and to infor- 
mation related to the end user. This algorithm takes as 
input the meta-content id, and additional information on 
40 the end user, as discussed in relation to figure 2. The 
output is the real content id. 

[0058] In the preferred embodiment, this algorithm 
works on a data structure that is a table, preferably a 
hashtable with the meta-content id as key and a set of 
45 objects containing for a given meta-content a set of real 
content as values. A lookup operation in a hashtable has 
constant time complexity. Figure 3 shows the meta-con- 
tent lookup process. This lookup process remains very 
general and the specific part in the meta-content lookup 
50 to each problem is contained in the object definition. In- 
deed, each object is a small data structure containing 
all the possible real contents corresponding to the given 
meta-content. The exact structure of this object de- 
pends on the functionality required. We give below sev- 
55 eral examples of such objects for economic broadband 
and for advertisement insertion. Each time there is a 
new problem that requires a mapping of a meta-content 
to several real contents, a new meta-content object can 
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be created. Figure 3 shows at 44 the input to the algo- 
rithm, that is the meta-content identifier MC A . It is used 
as an entry to the hash table 46, and this return an object 
real content 48. Based on the information 50 regarding 
the user - exemplified by l A and the subnetwork identi- 
fier, as in figure 3 - the real content C A 52 is provided. 
[0059] The size in memory of the meta-content table 
is in O(M), where M is the number of meta-content en- 
tries. The lookup speed in this meta-content table is in 
0(1). As this table simply maps a meta-content id to an 
object, there are additional memory space and lookup 
speed requirements depending on the object type. The 
invention is not restricted to any kind of object. 
[0060] As discussed in numerous instances above, in 
case a content does not require a meta-content lookup, 
this content will be mapped to the identity object, i.e. the 
object that returns for each meta-content, one and only 
one real content. 

[0061 ] As discussed above in step 8, the process then 
selects the closest - in terms of the underlying topology 
and the selected distance - surrogate server to the sub- 
network found by the algorithm of step 4. This algorithm 
knows the distances between all surrogate servers and 
the subnetworks in the database. In the preferred em- 
bodiment, this algorithm works on a data structure that 
is a table, preferably a hashtable with the subnetwork 
identifier as key and an ordered list of surrogate server 
identifiers as value. The list of surrogate server identifi- 
ers is ordered from the smallest to the longest distance 
between the surrogate servers and this subnetwork. 
This algorithm returns the first entry of this list when it 
is first called, the second of the list in a second call, and 
so on; this makes it possible to implement very easily 
the loop of steps 8, 1 0 and 1 4 of figure 1 . The algorithm 
returns a special entry that denotes that no surrogate is 
available, if no more entries are available; this makes it 
possible to interrupt the loop, as discussed above. 
[0062] An example of such a hash table is given in 
figure 4, where the first column represents the entry key 
of the hash table (Subnet), and where each line repre- 
sents the ordered list of the surrogate server (SS) for 
the subnetwork on the same line. As an example, for 
1000 surrogate servers, 100 000 subnetworks, 2 Bytes 
id for the surrogate servers identification, and 4 Bytes 
id for the subnetworks identification, the hash table size 
is: 1 0O'OOO * 1 000 * 2 + 1 OO'OOO * 4 = 1 91 MBytes. We 
note that this table could be easily compressed. Indeed, 
several subnetworks can have exactly the same posi- 
tion to all the surrogate servers. In figure 4, we see this 
is the case for Subnet 1 and Subnet 4 , both subnetworks 
have exactly the same position to all the surrogate serv- 
ers. Therefore, only one entry for both subnetworks is 
needed. By removing all the duplicate lines in the hash 
table, and by letting several subnetworks point to the 
same line, we can significantly reduce the size of the 
table. To find the closest (or n th closest) surrogate server 
for a given subnetwork requires one hash table lookup 
plus one lookup in an array. Both lookup operations 



have constant time complexity. When a change con- 
cerns a subnetwork, the update in this hash table is very 
efficient and simple, as we simply have to update a line 
in the hash table. When a surrogate server is added or 

5 removed, we need to change all the lines. However, 
these changes can be made line by line. 
[0063] The size in memory of this table, used to map 
the closest surrogate server to the subnetwork of the 
end user, is in 0(N*(S+1 )) where N is the number of giv- 

10 en subnetworks and S is the number of surrogate serv- 
ers. Finding the closest surrogate server requires a 
lookup into the table, using the identifier of the subnet- 
work as an entry. 

[0064] A solution for building up the table used in this 

15 step is now discussed. First, a distance is selected, as 
explained above; a measurement method may then be 
chosen. In the preferred embodiment one may use the 
method disclosed in European patent application n°01 
402 789.0 filed on October 26, 2001 and entitled " A 

20 process for measuring distance in a packet network with 
a plurality of routers"; this method returns the number 
of hops between the measuring site and the measure- 
ment target. The traceroute tool can be used for this pur- 
pose, but at the expense of a significant overhead in 

25 term of network traffic and time. 

[0065] For each subnetwork, a representative of this 
subnetwork is chosen for the measurement. This repre- 
sentative can be chosen according to a database of web 
servers, or in any other way that returns a valid repre- 

30 sentative, i.e. responding to the measurement protocol. 
The representative preferably has, as regards distance, 
a behaviour similar to the mean behaviour of the mem- 
bers of the subnetwork. 

[0066] To each surrogate server is sent the list of the 
35 representatives of all the known subnetworks. Each 
time a measurement between a surrogate server and a 
subnetwork target is completed, the table that gives the 
distance between each surrogate server and each sub- 
network is updated. The measurement process, in this 
40 example, is carried out from the surrogate server, since 
the measurement tool is adapted for use from the point 
from which distance is measured. Of course, in other 
instances, e.g. if the distance is a parameter set manu- 
ally, there would be no need to provide the surrogate 
45 servers with the list and to proceed with measurements 
from the surrogate servers. 

[0067] When an end user comes from a subnetwork 
without a valid representative and thus has not yet been 
measured, the process may choose randomly a surro- 

50 gate server for this end user, but immediately add this 
end user as the representative of this subnetwork, and 
trigger the measurements from the surrogate servers to 
this representative. This self-learning capability allows 
for a high relevance of the measurement information 

55 and ensures that over time all end users that requests 
content from this CDN are in measured subnetworks. 
Still, it does not penalize access to the resource by a 
new end user. 
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[0068] As discussed above in relation to step 1 0, the 
process verifies that the requested content is actually 
accessible from the closest surrogate server. If the con- 
tent is not available on this surrogate server, the process 
goes back to step 8 through step 14, thus choosing the 
next best surrogate server; if the content is available, 
the process terminates (step 1 2) and returns the chosen 
surrogate server identifier. 

[0069] In the preferred embodiment, the database 
holds for all surrogate servers the stored content iden- 
tifiers. The algorithm loads at initialization time this data 
from the database and stores it in an internal data struc- 
ture. One memory efficient implementation for this data 
structure is a hashtable with the content identifier as key 
and a bit array as value. For the bit array, it is useful to 
have the identifiers of the surrogate servers as unique 
integer values in the range from 1 to the number of de- 
ployed surrogate servers. So, the bit on position n de- 
notes whether the surrogate server with the identifier n 
has the content, the bit is set to 0 if the surrogate server 
does not have the content, to 1 if it has the content. An 
example of such a hash table is given in figure 5, where 
the first column represents the entry key of the hash ta- 
ble (Content id), and where the second column repre- 
sents the bit array introduced above for the content in 
the same line. As example, this structure can hold the 
information which content is on which surrogate server 
in less than 1 22 Mbytes for 1 ,000 surrogate servers and 
1 ,000,000 different content items. The calculation is the 
following. If we have 1 ,000 surrogate servers, we need 
a 1,000 bits bit array, thus 125 Bytes. If we have 
1,000,000 different contents, 3 Bytes are enough to 
identify each content. In summary, the size of this struc- 
ture is 1'000'000*(125+3) = 122 Mbytes. We note that 
this table could be easily compressed. Indeed, several 
content items could be present in the same set of sur- 
rogate servers. In figure 5, we see this is the case for 
contents (C 2 , C 4 , C 6 ) and (C 15 C 3 , C 5 ). In this example, 
for 6 contents we only need 2 entries. By removing all 
the duplicate bit arrays in the hash table, and by letting 
several contents point to the same line, we can signifi- 
cantly reduce the size of the table. When a new content 
is created, only one bit array needs to be added in the 
content lookup structure. When a surrogate server is 
added or removed, all the bit arrays must be modified. 
However, this update can be done line by line. 
[0070] The size in memory of the table of figure 5 is 
in 0(C*S) where S is the number of surrogate servers 
and C is the number of real contents. The lookup speed 
of this algorithm cannot be stated without a modeling of 
the distribution of the content among the surrogate serv- 
ers. This lookup speed is empirically shown to be fast - 
a few number of lookups in the data structures. 
[0071] To summarize, the overall complexity and 
memory sizes required for carrying out the process are 
reduced, and the process may be implemented in a 
standard PC. 

[0072] Several examples of use of the process are 



now discussed. In the preferred embodiment, the in- 
vented method is applied to the architecture shown in 
figure 6. It shows the Internet 58, with a number of end 
users 60 and surrogate servers 62. The CDN further 
5 comprises a redirection server 64 and several proxies 
66, 68 for the redirection server. All the algorithms de- 
scribed above may be implemented in the redirection 
server 64. The redirection server proxies 66, 68 aims to 
increase the number of request per second handled by 
the system. Indeed, the invented method is able to han- 
dle several hundred of thousand of request per seconds 
on standard PCs, but as the requests are HTTP based, 
the overhead induced by this protocol could make the 
number of requests drop down to only some thousands 
of requests per second. The idea beyond the redirection 
server proxy is to put the burden of the HTTP request 
handling in a large number of proxies, and that each 
proxy asks for the best adapted content and the optimal 
server that can serve this content to the redirection serv- 
er with a simple and fast protocol. For instance, a per- 
manent TCP connection can be opened between each 
proxy and the server, and each request is embedded 
within a TCP packet. The proxies can also be in charge 
of finding the information (device speed, profile, etc.) of 
the requesting end user. This way, all the overhead is 
pushed to the proxies that can be easily replicated, and 
the server is dedicated to perform the invented method. 
[0073] Figure 6 further shows an example of a request 
of an end user. We first consider that the end user has 
found a link to a content he wants to receive on a web 
server (not shown in the figure). So the end user has an 
HTTP link to a meta-content, which points to the redi- 
rection server proxies. In order to maintain only one link 
in the web page and to distribute the load among all the 
proxies, and because each proxy has a different IP ad- 
dress, two solutions can be considered : 

a layer 4/7 switch can be configured in front of the 
redirection server proxies in order to load balance 
all the requests it receives among the redirection 
server proxies. Thus, the address put in the HTTP 
link will the address of the layer 4/7 switch; 
a DNS round robin redirection may be used. In this 
case, a virtual redirection server hostname must put 
in the HTTP link, so that during the DNS name res- 
olution of the virtual redirection server hostname, 
the DNS returns the IP address of one of the redi- 
rection sever proxy chosen in a round robin manner. 

[0074] For the sake of simplicity, we do not represent 
the layer 4/7 switch or the DNS redirection in figure 6, 
and simply assume that the HTTP link points to one of 
the redirection server proxies. The end user sends an 
HTTP request to the redirection server proxy - step 70 
in figure 6: HTTP://redirection_server_proxy_address/ 
meta-content_id. When the proxy receives this request, 
it extracts the IP address of the end user and the id of 
the requested meta-content. Additionally, the redirec- 
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tion server proxy can retrieve further information on the 
profile of the end user. This information can be obtained 
by several ways : 

it can be included in the HTTP link returned by the 
web server and extracted by the redirection server 
proxy; 

it can be collected by the redirection server proxy 
when the end user sends the request to the redirec- 
tion server. This information can be retrieved using 
cookies, or any other way. 

[0075] Once the redirection server proxy has the IP 
address of the end user, the meta content id and even- 
tually additional information on the end user (depending 
on the kind of meta-content lookup expected), the proxy 
sends a request -step 72 in figure 6 -with the IP address 
of the end user, the meta-content id and with the addi- 
tional information on the end user. The redirection serv- 
er performs the subnetwork lookup, the meta-content 
lookup, and the surrogate server lookup as specified 
above, and returns to the proxy - step 74 in figure 6 - the 
best adapted real content and the optimal surrogate 
server that can serve this real content. The redirection 
server proxy returns to the end user - step 76 in figure 
6 - an HTTP redirect with the following URL: 

HTTP://optimal_surrogate_server_address/ 
best_adapted_real_content_id. 

Finally, the end user connects to the optimal surrogate 
server asking for the best-adapted real content - step 
78 in figure 6. 

[0076] This notion of redirection server proxy allows 
a feature that we call meta-file generation handling. Let 
first introduce the notion of meta-file. In order to allow 
for more complex meta-content management, the no- 
tion of meta-file was introduced. A metafile can be as 
example a HTML document, an ASX file (metafile for 
Windows Media), a SMIL file (metafiles for RealNet- 
works), or any other file that contains URLs for meta- 
contents embedded within other content. 
[0077] With a URL for a meta-content, an end user 
will receive only content. With a URL for a meta-file, the 
end user will receive a set of contents, which allows for 
complex formatting. For instance, in case the meta-file 
is an HTML document, each meta-content embedded 
within this HTML document can be flash animation. In 
case the meta-file is an ASX file, each meta-content em- 
bedded within this ASX file can be music clip. 
[0078] As an example of request of an end user, let 
see the example in figure 6 in the case of meta-file han- 
dling. The meta-file id is referenced in a URL in a web 
page that the end user accesses : 

HTTP://redirection_server_proxy_address/meta- 

filejd 

The end user sends an HTTP request to the redirection 
server proxy - step 70 in figure 6 : 

HTTP://redirection_server_proxy_address/ meta- 

file id 



When the proxy receives this request, it downloads from 
a database a template for this meta-file. The template 
is similar to the metafile itself except that there are tags 
to identify where the surrogate server address must be 

5 included and tags to give the meta-content id and show 
where the real content id must be included (in fact as 
replacement of the meta-content id). The redirection 
server proxy extracts the IP address of the end user and 
extracts from this template all the meta-content ids. Ad- 

10 ditionally, the redirection server proxy can retrieve fur- 
ther information on the profile of the end user. 
[0079] Once the redirection server proxy has the IP 
address of the end user, the meta content ids and even- 
tually additional information on the end user (depending 

15 on the kind of meta-content lookup expected), the proxy 
sends one request per meta-content id - step 72 in figure 
6 - with the I P address of the end user, the meta-content 
id and with the additional information on the end user. 
The redirection server performs the subnetwork lookup, 

20 the meta-content lookup, and the surrogate server 
lookup as specified above, and returns to the proxy - 
step 74 in figure 6 - the best adapted real content and 
the optimal surrogate server that can server this real 
content. Once the redirection server proxy has the best 

25 adapted real content id and the optimal surrogate server 
for each meta-content id, it replaces each tag in the me- 
ta-file template with the relevant information and returns 
the generated meta-file to the end user - step 76 in figure 
6. Finally, the end user connects for each real content 

30 to the optimal surrogate server, asking for the corre- 
sponding best adapted real content, as shown in step 
78 in figure 6. 

[0080] The notion of meta-file has numerous applica- 
tions. For instance: 

35 

playlist: the meta-file is a list of content to be played. 
For instance, an end user can click on a link disco 
and he gets a meta-file containing 20 famous disco 
video clips; 

40 - advertisement insertion: the meta-file contains the 
meta-content the end user asked for, preceded by 
a meta-content that is an advertisement. This ad- 
vertisement can either be static, i.e. the same for all 
the end user, or dynamic, i.e. changed by the meta- 

45 content lookup according to information on the end 
user. 

Example 1 . 

50 [0081] The embodiment now discussed exemplifies 
an object for the Meta-Content Lookup: the Economic 
broadband Object, which may be used for streaming 
media. The economic broadband object allows solving 
the following problems : 

55 

a provider wants to give a higher quality to its clients 

than to the other clients of other providers; 

a provider does not want do deliver a high quality 
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content to a user far from the closest surrogate serv- 
er that has this content, because he will experience 
poor quality due to the distance and will create con- 
gestion in the network. Instead the provider wants 
to send to him a low quality content; 
a provider does not want an end-user with a 
56.6Kbit/s modem to receive a stream at 300 Kbit/s. 

[0082] The object defined reaches the following scal- 
ability targets: Millions of contents; Thousands of re- 
gions; Four levels of quality for each content (0 is the 
lowest quality or no access to the content, and 3 is the 
highest quality); Four types of devices (we do not mean 
that we cope with only 4 devices, but that all devices can 
be mapped into four categories). The category 0 can be, 
for instance, for all the devices with a maximum through- 
put higher or equal than 28.8Kbit/s. Then, for instance, 
category 1 could be for all the devices with a maximum 
throughput higher or equal to 56.6Kbit/s and for all the 
unknown devices. The category 2 could be for all the 
devices with a maximum throughput higher or equal to 
300Kbit/s, and the category 3 could be for all the devices 
with a maximum throughput higher or equal to 700Kbit/ 
s. Moreover, we assume that all the contents for device 
type 1 can also be read by device type 2, and 3; all the 
contents for device type 2 can also be read by device 
type 3. This assumption is always verified when an in- 
creasing type refers to an increasing bandwidth capacity 
of the device. 

[0083] For the economic broadband object, the notion 
of region may be defined. A region is a set of subnet- 
works. In the preferred embodiment, the regions are 
maintained at the level of the structure that stores the 
subnetwork, in our embodiment the binary tree. At the 
initialization of the structure, each subnetwork is as- 
signed to a default region. Then, the media manager or 
CDN manager defines a new region and the new region 
id is assigned to the corresponding subnetworks. During 
a route lookup, the route corresponding to the end user 
IP address is returned, but also the corresponding re- 
gion id. 

[0084] The economic broadband object returns the 
real-content id based on the meta-content id, the device 
type of the end user, and the region id. The structure of 
this object is shown in figure 7. This object contains 
three arrays: The first array is the Region array; the sec- 
ond array is the Device array; the third array is the real 
content array. We describe the actions performed by the 
economic broadband object step by step : 

the input of the meta-content lookup when the eco- 
nomic broadband object is called is: the IP address 
of the end-user IP A , the meta-content id he wants 
to receive MC A , its device type D A , and the region 
R A corresponding to the IP address IP A of the end- 
user. This device type could be computed by the 
redirection server proxy when it receives the re- 
quest from the end-user; 



a lookup in the first array of the object is done. This 
array contains for each region, the required level of 
quality for the given meta-content. This lookup re- 
turns a 2 bits level of quality, Q R , for the meta-con- 
5 tent requested according to the region; 

then, according to the device type a lookup is made 
in the second array of the object that contains for 
each device the highest possible level of quality. 
This lookup returns a 2 bits, Q, that is the highest 
10 possible level of quality for this meta-content id ac- 
cording to the device of the end-user. If Q R is lower 
than Q (Q R <Q), then a last lookup is made (see step 
4), otherwise (Q R >Q) the lookup ends here, and the 
end-user cannot receive the stream. In order to 
15 avoid a missing real content in the output of the 
lookup, which can make the whole meta-file fail due 
to a wrong URL, a generic real content can be re- 
turned. For instance, a real content that contains the 
message: "Sorry, your device does not handle the 
20 encoding rate of the requested content"; 

finally, a last lookup is made in the third array of the 
object. This array contains the real-content id ac- 
cording to each level of quality. This lookup process 
is the following: Q R gives the column in the array. 
25 The lookup on this array returns the real-content id 
according to the region and to the device. However, 
in case this content is not available in the surrogate 
server (for instance, because the only one surro- 
gate server that has this content is down), a generic 
30 real content can be returned, in order to avoid a 
missing real content in the output of the lookup, 
which can make the a whole meta-file fail due to a 
wrong URL. For instance, a content that contains 
the message: "Sorry, the requested content is cur- 
35 rently not available". 

[0085] Now we give an evaluation of this object. The 
first entry is an array with as many cells as regions, each 
cell containing a 2 bits flag that gives the quality for this 

40 content according to the region. The second entry is an 
array with four cells, one per category of devices. Each 
cell contains a 2 bit flag that gives the highest level of 
quality that this category of device can handle. The third 
entry is an array with four cells, each cell containing a 

45 real-content id according to the level of quality corre- 
sponding to this cell. For 3000 regions, and 8 bytes for 
the meta-content id, the size of this object is: 3 
000*2+4*2 + 4*8*8=783 Bytes. For 1 million of meta- 
contents with economic broadband the size of all the ob- 

50 jects will be: 783 MBytes. We note that this table could 
be easily compressed. Indeed, several objects will have 
exactly the same region array and the same device ar- 
ray. This correlation is due to the management of the 
content. It makes no sense to define completely different 

55 scenarios for each content. The easiest way to proceed 
is to define sets of contents that will have exactly the 
same configuration (i.e. same level of quality, same 
quality per region, same quality per device). The way to 
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handle the compression is very simple. We define two 
arrays: a compressed region two-dimensional array, 
and a compressed device two-dimensional array. When 
several objects have the same region array, we create 
a new line in the compressed region two-dimensional 
array, and we replace the region array in the objects by 
a pointer that points to the related line in the compressed 
region two-dimensional array. We do exactly the same 
for the device array. 

[0086] To find the real-content id according to the ob- 
ject, the region and, the device requires only 3 lookups 
in an array (one lookup per entry). We note that the in- 
vented object can be restricted to a region only or to a 
device only lookup. 

[0087] The general economic broadband object gives 
a structure that is very flexible and allows for a large 
variety of configurations. We give simple examples of 
potential applications. We first note that quality among 
the real-contents for each meta-content id is uncorrect- 
ed. Indeed, in the example of figure 7, we can define C 1 
as a56Kbit/s content, C 18 as a 300Kbit/s, C 17 as a 700 
Kbit/s, and C 2 as a 1 .5Mbit/s. But we can also define C 1 
as a sorry message at 56Kbit/s, C 18 as a 10 seconds 
trailer at 56Kbit/s, C 17 as a 3 minutes trailer at 56 Kbit/ 
s, and C 2 as the full movie at 1 .5 Mbit/s. We also see 
that the quality of each content is independent of the 
category of the devices. Therefore, the content provider 
has a total freedom when defining its real-contents, the 
only one requirement is to define a level of quality for 
each real-content, which is relative to the other real-con- 
tents for the same meta-content id. 
[0088] In figure 8, we give an example of configuration 
for four meta-content ids in case of economic broad- 
band. We assume, for this example, that the four cate- 
gories of devices are defined as: 0>56Kbit/s, 1>300 
Kbit/s, 2>700 Kbit/s, 3>1 .5 Mbit/s. We give a description 
of the configuration for each meta-content id : 

for MC 1? four levels of quality are defined as there 
are four real-contents (see figure 8). We define C 1 
as a 56 Kbit/s content, C 18 as a 300 Kbit/s, C 17 as 
a 700 Kbit/s, and C 2 as a 1 .5Mbit/s. MC 1 represents 
the general case, 

for MC 2 , four levels of quality are defined as there 
are four real-contents (see figure 8). We define 
as a sorry message at 5 6 Kbit/s, C 3 as a 1 0 second 
advertisement of a movie at 56Kbit/s, C 4 as a 3 
minute advertisement of a movie at 56 Kbit/s, and 
C 20 as the full movie at 1.5 Mbit/s. MC 2 can repre- 
sent a case where a provider does not want to give 
a content to end-users that are not in its own net- 
work, but wants to advertise for this content outside 
of its network without using a lot of resources (only 
short 56 Kbit/s streams); 

for MC 3 , two levels of quality are defined (0 and 1 ). 
We define C 20 as a 28.8 Kbit/s sorry message, and 
C 7 as a 300 Kbit/s stream. MC 2 can represents a 
case where a provider does not want to give a con- 



tent to end-users that are not in its own network; 
for MC 4 , two levels of quality are defined (see figure 
8). We define C 8 as a56Kbit/s movie, and C 16 as a 
700 Kbit/s movie. MC 4 can represents a case where 
5 a provider just want to give a high quality stream to 

its end-user, and a low quality stream to end-users 
that are not in its own network. 

[0089] This embodiment makes it possible for the 
10 CDN to adapt the media effectively delivered to the user, 
according to the user information. 

Example 2. 

15 [0090] The embodiment now discussed exemplifies 
an object for the Meta-Content Lookup: the Advertise- 
ment insertion Object. These advertisement insertion 
objects allow solving the following problems : 

20 - a provider wants to add a static advertisement to a 
content, i.e. an advertisement that will be independ- 
ent of the location of the end-user, 
a provider wants to add a regionalized advertise- 
ment to a content, i.e. an advertisement that de- 
25 pends on the location of the end-user; 

a provider does not want an end-user with a 
56.6Kbit/s modem to receive an advertisement en- 
coded at 300 Kbit/s; 

a provider wants to define only one advertisement 
30 that maps to a carousel of real advertisements, 
each real advertisement being served in a round 
robin manner. This way, when this kind of advertise- 
ment is associated to a content, an end-user will not 
see the same advertisement each time he will re- 
35 quest for the content; 

a provider wants to be able to guarantee the number 
of view or the percentage of view for each adver- 
tisement. 

40 [0091] The objects defined reaches the following scal- 
ability targets: Thousands of ads, each ad correspond- 
ing to a different real advertisement for each region (the 
number of different real advertisement is, if we do not 
have static advertisements, the number of ads time the 
45 number of regions); Thousands of regions; Four types 
of devices (the explanation of the devices definition is 
the same as for the economic broadband object). 
[0092] In the following, we will introduce three objects 
for the advertisement insertion : 

50 

the regionalized advertisement object; 
the carrousel of advertisement object; 
the per-view billing carrousel of advertisement ob- 
ject. 

55 

[0093] For the advertisement insertion objects, the 
notion of region is defined as in the case of the economic 
broadband object. 
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[0094] The advertisement insertion objects return the 
real-content id based on the meta-content id, the device 
type of the end user, and the region id. The input of the 
meta-content lookup when the advertisement insertion 
object is called is: the IP address of the end-user IP A , 
the meta-content id he wants to receive MC A , its device 
type D A , and the region R A corresponding to the IP ad- 
dress IP A of the end-user. This device type could be 
computed by the redirection server proxy when it re- 
ceives the request from the end-user. 
[0095] We introduce three objects: figure 9 shows the 
regionalized advertisement object; figure 10 shows the 
carousel of advertisement object; figure 11 shows the 
per-view billing carousel of advertisement object. In the 
following, we describe step by step the actions per- 
formed by each object: 



probability of views for each advertisement, let say 
50% for MC 1? 30% for MC 2 , and 20% for MC 3 . 
Count-/ can be set to 5, Count 2 can be set to 3, and 
Count 3 can be set to 2, as 50, 30, and 20 are mul- 
5 tiple of 10. We note that the setting of these 

counters is done by the media manager according 
to the request of the advertisers. Then, a pointer is 
set to the advertisement to be returned. At the ini- 
tialization, the pointer is set to the first advertise- 
10 ment. When an advertisement is returned, let say 
MC 1? Count-, and Count'-, are decreased by one. If 
Count-, reaches 0, the pointer is set to the next ad- 
vertisement, and Count-, is re-set to its initial value. 
When Count'-, reaches 0, the corresponding entry 
15 is removed from the carousel. With this object, we 
are able to guarantee a mean probability of views 
among the advertisements of the same object, and 
a maximum number of views for each advertise- 
ment. We do not further explore this issue. This 
problem could be further studied in case a client 
needs for a solution to the per-view billing that can- 
not be solved by the regular carousel of advertise- 
ments. 

[0096] In case the real content returned by the adver- 
tisement insertion object cannot be found in any surro- 
gate servers, a missing real content in the output of the 
lookup can happen, which can make a whole meta-file 
fail due to a wrong URL. In order to avoid this problem 
a generic real content can be returned when the regular 
lookup failed. 

[0097] Now, we give an evaluation of the objects 
introduced : 

the regionalized advertisment object is a 2 dimen- 
sional array. Each cell contains the id of a real-con- 
tent according to the region and to the device. For 
1 0 000 regionalized advertisements, 8 bytes per id, 
and 1000 regions, the size of this table is: 10 
000*4*1000*8 = 320 MBytes. To find the real-con- 
tent id according to the object, the region, and the 
device requires only 1 lookup in an array; 
the carousel of advertisement object is an array. 
Each cell contains the id of a meta-content. For 10 
000 carousel, 1000 meta-content ids per carousel, 
8 bytes per id, the size of this table is: 10 
000*1000*8 = 80 MBytes. When a new meta-con- 
tent id is added, removed, or modified, the update 
in this array is very efficient and simple by adding, 
removing or modifying a cell in an array; 
the per-view carousel of advertisement has proper- 
ties similar to the carousel of advertisement object 
except that for each meta-content we add 2 
counters, a 1 byte counter for the probability of 
views, and a 4 bytes counter for the number of 
views. 

[0098] The embodiment solves the problems dis- 



if the object is a regionalized advertisement object, 
a lookup in the array structure is done. This array 
contains for each region and for each device the re- 20 
al-content id. We note that the media provider does 
not need to specify a different real-content id per 
region. In figure 9, we see that only 3 real-content 
ids are specified. However, in case of static adver- 
tisement, the meta-content id can simply refer to the 25 
identity object. Then, according to the region R A and 
to the device of the end-user D A , the real-content id 
is returned; 

if the object is a carrousel of advertisement object 
(see figure 1 0), a lookup in the array of the structure 30 
is done. This array contains a set of meta-content 
ids and a pointer pointing to the next meta-content 
id to be returned. The meta-content id pointed by 
the pointer is returned. After a carousel lookup, the 
pointer is shifted on the next meta-content id in the 35 
array. If this position is the last in the array, the point- 
er is shift to the beginning of the array. The meta- 
content ids for a given carousel are returned with a 
round robin policy. We note that in an array, chang- 
ing the distribution of meta-content ids will change 40 
the distribution of the number of views for each ad- 
vertisement. For instance, in figure 10, MC 5 will ex- 
perience 3/4 of the views whereas MC 2 will experi- 
ence 1/4 of the views. This property is at no cost for 
the redirection server and could be exploited by the 45 
media manager in order to guarantee a given 
number or percentage of views to the advertisers. 
Once the meta-content is returned by the carousel 
lookup, this meta-content is used for a new meta- 
content lookup; 50 
if the object is a per-view billing carousel of adver- 
tisements (see figure 11) a lookup in this object is 
done. Two counters are maintained for each meta- 
content: The first one allows enforcing some prob- 
ability of views; the second one sets the maximum 55 
number of views for the corresponding advertise- 
ment. Imagine that in addition to the above request- 
ed number of views, we also want to enforce a given 
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cussed above. 

[0099] The process also applies where the network 
comprises a private network, that is a network, the ad- 
dresses of which are hidden from the rest of the network. 
This may be the case where the requesting end user is 
beyond a Network Address Translator (NAT), a proxy, 
or another network element that hides its IP address. In 
this case, it is not possible to locate such an end user 
with a finer granularity than the IP address of the hiding 
network element. In case several surrogate servers are 
also beyond the same network element, it is not possible 
for the redirection server to find the optimal surrogate 
server. We call a network beyond such a hiding network 
element a private network. All the addresses inside a 
private network are only available to other computer in- 
side the same private network. We call the network el- 
ements that allow reaching the outside of the private net- 
work the private networks interfaces (PNI). 
[0100] The private IP addresses of all the computers 
inside the private network are hidden from the outside 
of this private network. The invented method includes a 
solution to the problem of the localization of an end user 
inside a private network based on local redirection serv- 
ers. This solution also works in case of nested private 
networks. The key of the solution is to place a local re- 
direction server inside each private network that con- 
tains at least one surrogate server. In case a private net- 
work does not contain a surrogate server, it does not 
make sense to place a local redirection server inside it 
as this private network can be considered as a whole, 
and all the end users inside this private network will be 
redirected to the closest - in relation to the PNI - surro- 
gate server outside of the private network. However, in 
case it makes sense to redirect the end user inside the 
private network to different surrogate servers outside of 
the private network, according to the localization of the 
end user (for instance, when the private network spans 
over several countries), a local redirection server must 
be placed inside this private network. 
[0101] In the preferred embodiment, the solution to 
use the invention at the same time within and outside 
private networks uses the following mechanism. 
[0102] Each time a local redirection server is installed, 
the subnetwork corresponding to the private IP address- 
es is declared in the list of subnetworks. As private IP 
addresses are only unique within the private network, a 
unique identifier for each private network must be de- 
fined in order to avoid collision inside the list of subnet- 
works. In the preferred embodiment, we choose as 
unique identifier the IP address of the network element 
(NAT, gateway, etc.) that defines this private network. 
The main advantage of this method is that we can con- 
catenate this public IP address with the subnetwork in 
order to obtain global and unique addresses of all sub- 
networks, and still use the same subnetwork lookup al- 
gorithm. In case there are several PNIs for the same 
private network, the private subnetwork is declared for 
all these PNIs. 



[01 03] When the redirection server receives a request 
from an end user inside a private network, it searches 
whether there is a local redirection server inside this pri- 
vate network. If there is no local redirection server, the 

5 end user localization is the same as for the network el- 
ement that defines the private network. If there is a local 
redirection server inside this private network, the redi- 
rect server does an HTTP redirect of the end user to the 
local redirection server; when the local redirection serv- 

10 er receives the request from the end user, it searches 
whether this request comes from a PNI inside its own 
private network. If this is the case, the local redirection 
server redirects the end user to the redirection server of 
the nested subnetwork. This process occurs recursively 

15 until a local redirection server finds that the end user 
comes from its own private network. Then, the local re- 
direction server sends a request to the redirection server 
with the IP address of the end user. The redirection serv- 
er finds the best adapted real content and the optimal 

20 surrogate server according to the real location of the end 
user. Then the redirection server returns the meta-file to 
the local redirection server that forwards this meta-file 
to the end user. 

[0104] In an alternative embodiment, the local redi- 
25 rection server within a private network carries out the 
complete process, and contacts the redirection server 
outside the private network only in the case, when the 
requested content is not available on a local surrogate 
server. This setup assumes that the user makes the in- 
30 itial request to the local redirection server; if the Web 
page with the URL is hosted outside the private network 
this is possible by using different local DNS entries for 
the redirection server that returns for local users the IP 
address of the local redirection server. 
35 [0105] The invention was discussed above, without 
any specific indication of the practical implementation of 
the CDN. One may use for implementing the CDN, and 
notably the redirection servers, any of the solutions 
known in the art. The algorithms discussed in the vari- 
40 ous steps of the invention may be carried out in the re- 
direction server, since this is the location where the in- 
formation is easily available. The redirection server can 
be placed in a central location of the CDN, but may also 
be replicated among different locations. An example 
45 would be a global CDN with redirection servers on each 
continent to minimize latencies.. The algorithms may be 
programmed with known methods, based on the infor- 
mation given above. 

[0106] The invention has the following advantages 
50 over the prior art. As compared with DNS redirection 
method, the invention offers the following advantages : 

optimal: The accuracy of DNS redirection methods 
relies on the placement of the local DNS server for 
55 each requestor. The DNS server used by a partic- 
ular end-user may be distant within the network to- 
pology from the end-user resulting in wrong as- 
sumptions for the redirection engine; 
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end-user data: DNS redirection methods do not rec- 
ognize the end-user or the end-user network; only 
the DNS server used by the end-user is known. The 
redirection itself can only redirect the request to a 
different server. In contrast, the described invention 
works on the application level and can use all data 
known from the end-user to formulate the response 
(and thus include as example different advertising 
for different end-users). For instance, US-A-60108 
703 uses dynamic DNS redirection to select a sur- 
rogate server close to the end-user. The main dis- 
advantage of DNS redirection is that the result 
might not be accurate for the requester, since the 
DNS server used by the end-user's browser is not 
close to the end-user. Moreover, with dynamic DNS 
it is not possible to get the context of the end-user. 

As compared with load balancing methods, the inven- 
tion offers better quality: the goal of load-balancing 
methods is to distribute end-user requests equally 
among several servers, thus optimizing the overall re- 
sources of the servers. In contrast, the described inven- 
tion chooses the server that is most likely to deliver the 
best quality to the end-user over a period of time; the 
quality criteria is the distance from the server to the end- 
user. If several replica servers have the same distance 
to the end-user, the described method load-balances 
among them. 

[01 07] The invention is also advantageous over tradi- 
tional HTTP redirection methods: HTTP redirection 
methods compute the best server to satisfy a given re- 
quest and respond with an HTTP redirection directive. 
The end-user's browser then connects to the URL given 
in the HTTP redirection directive. In contrast to the HT- 
TP redirection methods, this invention responds with the 
expected file, in which the URLs to the requested con- 
tents have been substituted with the URLs to the optimal 
replica servers. 

[0108] In addition, contrary to the solution proposed 
in US-A-6 003 030, the invention does not relies on spe- 
cial software that is installed at the end-user terminal, 
to chose the best surrogate server. 
[0109] The present invention can serve directly the 
best-adapted resource to the end-user, but with replace- 
ments for any hyperlinks to other resources to obtain 
best responses for those embedded resources. There 
is no need for the user to connect first to the to the replica 
router and then a second time to the replica server, as 
discussed in US-A-6 052 718; this avoids unnecessary 
latency. Contrary to US-B-6 185 598, the invention 
serves the resource with adapted resource locators de- 
pending on the end-user's and service provider's con- 
text, without needing to intercept resource requests. 



Claims 

1. A process for selecting, in a content delivery net- 



measuring a distance between each surrogate 
server and each subnetwork; 
according to the subnetwork of the user, select- 
ing a surrogate server with the smallest dis- 
tance. 

40 4. The process of claim 3, wherein a distance between 
a surrogate server and a subnetwork is selected 
from : 

a number of hops within the network from the 
45 surrogate server and a representative of the 

subnetwork; 

a round trip time within the network from the 
su rrogate server to a representative of the sub- 
network; 

50 - a parameter entered manually or read from a 
connection table; 

available or bottleneck bandwidth within the 
network from the surrogate server and a repre- 
sentative of the subnetwork; 
55 - costs to reach a representative of the subnet- 
work from the surrogate server. 

5. The process of claim 3 or 4, wherein distances are 



work having at least two surrogate servers (62) de- 
livering content over a network, a surrogate server 
adapted to deliver content requested by a user (60), 
the process comprising the steps of: 

5 

defining at least two subnetworks in the net- 
work, and upon request of the user, finding (4) 
a subnetwork of the user; 
determining (8) the surrogate server closest to 
10 the subnetwork of the user; 

checking (10) whether the closest surrogate 
server has the ability to serve the content re- 
quested by the user, and 

15 . if the closest surrogate server has the abil- 

ity to serve the content requested by the 
user, selecting the closest surrogate serv- 
er; 

else, if the closest surrogate server does 
20 not have the ability to serve the content re- 

quested by the user, disregarding (14) the 
closest possible surrogate server and re- 
peating the step of determining and check- 
ing. 

25 

2. The process of claim 1 , wherein the step of finding 
a subnetwork is carried out using an algorithm with 
a time complexity equal to or less than O(W), with 
W the length of an address in the network. 

30 

3. The process of claim 1 or 2, wherein the step of de- 
termining comprises 
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stored in a table with a subnetwork identifier as key. 

6. The process of claim 5, wherein an entry in the table 
is a list of surrogate servers identifiers, ordered by 
distance to the subnetwork identified by the key to 
the entry. 

7. The process of one of claims 1 to 6, wherein the 
step of determining is carried out using an algorithm 
with a space complexity equal or less than 0(N* 
(S+1 )), with N the number of subnetworks and S the 
number of surrogate servers. 

8. The process of one of claims 1 to 7, wherein the 
step of checking comprises 



past behaviour of the user; 
a subnetwork of the user; 
identity of the user. 

5 15. The process of claim 12, 13 or 14, wherein the real 
contents of the set are representing the same con- 
tent with a different video quality or a different audio 
quality. 

10 16. The process of one of claims 12 to 15, wherein the 
real contents of the set are representing the same 
content in different languages. 

17. The process of claim 12, 13 or 14, wherein the real 
15 contents of the set comprise advertisements. 



storing for each surrogate server identifiers of 
the content contained in the surrogate server; 
mapping an identifier of the content requested 
by the user; and 

comparing the identifier of the content request- 
ed by the user and the identifiers of the content 
contained in the closest surrogate server. 

9. The process of claim 8, wherein the identifiers are 
stored in a table with the content identifier as key. 

10. The process of claim 9, wherein each entry in the 
table is a bit array, one bit in the array being repre- 
sentative of the presence in a surrogate server of 
the content corresponding to the key to the entry. 

1 1 . The process of one of claims 1 to 1 0, wherein the 
step of checking is carried out using an algorithm 
with a space complexity equal or less than 0(C*S), 
with C the number of content and S the number of 
surrogate servers. 

12. The process of one of claims 1 to 11 , wherein a re- 
quest from a user contains an identifier of a set of 
real contents, each of said real content being con- 
tained in the content delivery network, wherein the 
process further comprises, before the step of 
checking, a step of selecting a real content of the 
set according to the request of the user, and where- 
in the step of checking comprises checking whether 
the closest surrogate server has the selected real 
content. 

13. The process of claim 12, wherein the step of select- 
ing a real content is carried out according to infor- 
mation related to the user. 

14. The process of claim 13, wherein said information 
related to the user is selected among: 

technical characteristics of a device of the user; 
a location of the user; 



18. The process of claim 16, wherein the real contents 
of the set comprise regionalized advertisements, 
personalized advertisements or independent ad- 

20 vertisements. 

19. The process of one of claims 1 to 18, wherein the 
network comprises a private network, addresses of 
which are hidden from the rest of the network, said 

25 private network containing at least one surrogate 
server, the process comprising 

providing a local redirection server within the 
private network; 
30 - carrying the steps of defining, determining and 
checking within the local redirection server, and 
if no surrogate server is selected within the pri- 
vate network, repeating the steps of defining, 
determining and checking within a redirection 
35 server of the content delivery network outside 

of the private network. 

20. The process of one of claims 1 to 19, wherein the 
network comprises a private network, addresses of 

40 which are hidden from the rest of the network, said 
private network containing at least one surrogate 
server, the process comprising 

providing a local redirection server within the 
45 private network; 

upon request of the user, having the local redi- 
rection server forward an address of the user 
in the private network to a redirection server of 
the content delivery network outside of the pri- 
50 vate network. 

21. The process of claim 20, wherein the steps of de- 
fining, determining and checking are carried out in 
the redirection server outside of the private network. 

55 

22. A process for selecting, in a content delivery net- 
work having at least two surrogate servers deliver- 
ing content over a network, a surrogate server 
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adapted to deliver content requested by a user, the 
process comprising the steps of: 

defining at least two subnetworks in the net- 
work, and upon request of the user, finding a 5 
subnetwork of the user; 
determining the surrogate servers having the 
ability to serve the content requested by the us- 
er; 

selecting among the surrogate servers thus de- 10 
termined the surrogate server closest to the 
subnetwork of the user. 



23. A computer program product adapted to carry out 

the process of one of claims 1 to 22. 15 

24. A content delivery network with a redirection server 
and at least one surrogate server, the redirection 
server comprising computer programs means for 
carrying out the process of one of claims 1 to 22. 20 
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